Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 98818 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 14.1 MiB |
| Average record size in memory | 149.8 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 1 |
df_index is highly correlated with friend_count and 1 other fields | High correlation |
friend_count is highly correlated with df_index and 1 other fields | High correlation |
friendships_initiated is highly correlated with df_index and 1 other fields | High correlation |
likes is highly correlated with mobile_likes and 1 other fields | High correlation |
likes_received is highly correlated with mobile_likes_received and 1 other fields | High correlation |
mobile_likes is highly correlated with likes | High correlation |
mobile_likes_received is highly correlated with likes_received and 1 other fields | High correlation |
www_likes is highly correlated with likes | High correlation |
www_likes_received is highly correlated with likes_received and 1 other fields | High correlation |
df_index is highly correlated with friend_count and 4 other fields | High correlation |
friend_count is highly correlated with df_index and 4 other fields | High correlation |
friendships_initiated is highly correlated with df_index and 3 other fields | High correlation |
likes is highly correlated with likes_received and 4 other fields | High correlation |
likes_received is highly correlated with df_index and 6 other fields | High correlation |
mobile_likes is highly correlated with likes and 3 other fields | High correlation |
mobile_likes_received is highly correlated with df_index and 6 other fields | High correlation |
www_likes is highly correlated with likes and 1 other fields | High correlation |
www_likes_received is highly correlated with df_index and 6 other fields | High correlation |
df_index is highly correlated with friend_count and 1 other fields | High correlation |
friend_count is highly correlated with df_index and 1 other fields | High correlation |
friendships_initiated is highly correlated with df_index and 1 other fields | High correlation |
likes is highly correlated with likes_received and 3 other fields | High correlation |
likes_received is highly correlated with likes and 3 other fields | High correlation |
mobile_likes is highly correlated with likes and 2 other fields | High correlation |
mobile_likes_received is highly correlated with likes and 3 other fields | High correlation |
www_likes_received is highly correlated with likes and 2 other fields | High correlation |
www_likes is highly correlated with likes | High correlation |
friendships_initiated is highly correlated with df_index and 1 other fields | High correlation |
df_index is highly correlated with friendships_initiated and 1 other fields | High correlation |
mobile_likes is highly correlated with likes | High correlation |
friend_count is highly correlated with friendships_initiated and 1 other fields | High correlation |
likes is highly correlated with www_likes and 1 other fields | High correlation |
likes_received is highly correlated with mobile_likes_received and 1 other fields | High correlation |
mobile_likes_received is highly correlated with likes_received and 1 other fields | High correlation |
www_likes_received is highly correlated with likes_received and 1 other fields | High correlation |
likes_received is highly skewed (γ1 = 112.0109727) | Skewed |
mobile_likes_received is highly skewed (γ1 = 107.4678309) | Skewed |
www_likes_received is highly skewed (γ1 = 126.1856832) | Skewed |
df_index is uniformly distributed | Uniform |
df_index has unique values | Unique |
friend_count has 1956 (2.0%) zeros | Zeros |
friendships_initiated has 2988 (3.0%) zeros | Zeros |
likes has 22277 (22.5%) zeros | Zeros |
likes_received has 24392 (24.7%) zeros | Zeros |
mobile_likes has 34994 (35.4%) zeros | Zeros |
mobile_likes_received has 29956 (30.3%) zeros | Zeros |
www_likes has 60927 (61.7%) zeros | Zeros |
www_likes_received has 36817 (37.3%) zeros | Zeros |
Reproduction
| Analysis started | 2021-08-17 07:02:49.834101 |
|---|---|
| Analysis finished | 2021-08-17 07:03:14.957965 |
| Duration | 25.12 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
df_index
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORMUNIQUE| Distinct | 98818 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49501.35276 |
| Minimum | 0 |
|---|---|
| Maximum | 99002 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 772.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4950.85 |
| Q1 | 24747.25 |
| median | 49506.5 |
| Q3 | 74253.75 |
| 95-th percentile | 94054.15 |
| Maximum | 99002 |
| Range | 99002 |
| Interquartile range (IQR) | 49506.5 |
Descriptive statistics
| Standard deviation | 28580.82261 |
|---|---|
| Coefficient of variation (CV) | 0.5773745769 |
| Kurtosis | -1.200157688 |
| Mean | 49501.35276 |
| Median Absolute Deviation (MAD) | 24753.5 |
| Skewness | -8.232036957 × 10-5 |
| Sum | 4891624677 |
| Variance | 816863420.8 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 85379 | 1 | < 0.1% |
| 7529 | 1 | < 0.1% |
| 5480 | 1 | < 0.1% |
| 28007 | 1 | < 0.1% |
| 25958 | 1 | < 0.1% |
| 32101 | 1 | < 0.1% |
| 30052 | 1 | < 0.1% |
| 19811 | 1 | < 0.1% |
| 17762 | 1 | < 0.1% |
| Other values (98808) | 98808 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 99002 | 1 | |
| 99001 | 1 | |
| 99000 | 1 | |
| 98999 | 1 | |
| 98998 | 1 | |
| 98997 | 1 | |
| 98996 | 1 | |
| 98995 | 1 | |
| 98994 | 1 | |
| 98993 | 1 |
age
Real number (ℝ≥0)
| Distinct | 83 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33.89545427 |
| Minimum | 13 |
|---|---|
| Maximum | 95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 772.1 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 20 |
| median | 28 |
| Q3 | 45 |
| 95-th percentile | 68 |
| Maximum | 95 |
| Range | 82 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 17.35955148 |
|---|---|
| Coefficient of variation (CV) | 0.5121498399 |
| Kurtosis | 0.3833371448 |
| Mean | 33.89545427 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 1.082567963 |
| Sum | 3349481 |
| Variance | 301.3540275 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 28 | 6621 | 6.7% |
| 18 | 5196 | 5.3% |
| 23 | 4401 | 4.5% |
| 19 | 4390 | 4.4% |
| 20 | 3768 | 3.8% |
| 21 | 3670 | 3.7% |
| 25 | 3631 | 3.7% |
| 17 | 3281 | 3.3% |
| 16 | 3086 | 3.1% |
| 22 | 3032 | 3.1% |
| Other values (73) | 57742 |
| Value | Count | Frequency (%) |
| 13 | 484 | 0.5% |
| 14 | 1925 | 1.9% |
| 15 | 2617 | |
| 16 | 3086 | |
| 17 | 3281 | |
| 18 | 5196 | |
| 19 | 4390 | |
| 20 | 3768 | |
| 21 | 3670 | |
| 22 | 3032 |
| Value | Count | Frequency (%) |
| 95 | 76 | 0.1% |
| 94 | 181 | |
| 93 | 202 | |
| 92 | 50 | 0.1% |
| 91 | 74 | 0.1% |
| 90 | 68 | 0.1% |
| 89 | 58 | 0.1% |
| 88 | 60 | 0.1% |
| 87 | 41 | < 0.1% |
| 86 | 75 | 0.1% |
gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 MiB |
| male | |
|---|---|
| female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.814669392 |
| Min length | 4 |
Characters and Unicode
| Total characters | 475776 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | male |
|---|---|
| 2nd row | female |
| 3rd row | male |
| 4th row | female |
| 5th row | male |
Common Values
| Value | Count | Frequency (%) |
| male | 58566 | |
| female | 40252 |
Length
Pie chart
| Value | Count | Frequency (%) |
| male | 58566 | |
| female | 40252 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 139070 | |
| m | 98818 | |
| a | 98818 | |
| l | 98818 | |
| f | 40252 | 8.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 475776 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 139070 | |
| m | 98818 | |
| a | 98818 | |
| l | 98818 | |
| f | 40252 | 8.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 475776 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 139070 | |
| m | 98818 | |
| a | 98818 | |
| l | 98818 | |
| f | 40252 | 8.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 475776 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 139070 | |
| m | 98818 | |
| a | 98818 | |
| l | 98818 | |
| f | 40252 | 8.5% |
tenure
Real number (ℝ≥0)
| Distinct | 2418 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 535.6817887 |
| Minimum | 0 |
|---|---|
| Maximum | 3139 |
| Zeros | 69 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 772.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 47 |
| Q1 | 226 |
| median | 412 |
| Q3 | 673 |
| 95-th percentile | 1567 |
| Maximum | 3139 |
| Range | 3139 |
| Interquartile range (IQR) | 447 |
Descriptive statistics
| Standard deviation | 454.2589293 |
|---|---|
| Coefficient of variation (CV) | 0.8480014419 |
| Kurtosis | 2.195246589 |
| Mean | 535.6817887 |
| Median Absolute Deviation (MAD) | 212 |
| Skewness | 1.530829395 |
| Sum | 52935003 |
| Variance | 206351.1748 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 300 | 173 | 0.2% |
| 303 | 170 | 0.2% |
| 242 | 163 | 0.2% |
| 272 | 163 | 0.2% |
| 257 | 161 | 0.2% |
| 297 | 161 | 0.2% |
| 280 | 160 | 0.2% |
| 285 | 160 | 0.2% |
| 284 | 158 | 0.2% |
| 278 | 158 | 0.2% |
| Other values (2408) | 97191 |
| Value | Count | Frequency (%) |
| 0 | 69 | |
| 1 | 60 | |
| 2 | 72 | |
| 3 | 79 | |
| 4 | 86 | |
| 5 | 92 | |
| 6 | 93 | |
| 7 | 84 | |
| 8 | 87 | |
| 9 | 93 |
| Value | Count | Frequency (%) |
| 3139 | 3 | |
| 3129 | 1 | < 0.1% |
| 3128 | 1 | < 0.1% |
| 3101 | 1 | < 0.1% |
| 3019 | 1 | < 0.1% |
| 2958 | 1 | < 0.1% |
| 2926 | 1 | < 0.1% |
| 2888 | 1 | < 0.1% |
| 2822 | 1 | < 0.1% |
| 2788 | 1 | < 0.1% |
friend_count
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 2561 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 196.3898986 |
| Minimum | 0 |
|---|---|
| Maximum | 4923 |
| Zeros | 1956 |
| Zeros (%) | 2.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 772.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 31 |
| median | 82 |
| Q3 | 206 |
| 95-th percentile | 720 |
| Maximum | 4923 |
| Range | 4923 |
| Interquartile range (IQR) | 175 |
Descriptive statistics
| Standard deviation | 387.4751451 |
|---|---|
| Coefficient of variation (CV) | 1.972989181 |
| Kurtosis | 50.08132589 |
| Mean | 196.3898986 |
| Median Absolute Deviation (MAD) | 64 |
| Skewness | 6.058962436 |
| Sum | 19406857 |
| Variance | 150136.9881 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1956 | 2.0% |
| 1 | 1814 | 1.8% |
| 2 | 1115 | 1.1% |
| 3 | 860 | 0.9% |
| 5 | 785 | 0.8% |
| 4 | 747 | 0.8% |
| 10 | 737 | 0.7% |
| 24 | 732 | 0.7% |
| 6 | 720 | 0.7% |
| 29 | 718 | 0.7% |
| Other values (2551) | 88634 |
| Value | Count | Frequency (%) |
| 0 | 1956 | |
| 1 | 1814 | |
| 2 | 1115 | |
| 3 | 860 | |
| 4 | 747 | 0.8% |
| 5 | 785 | |
| 6 | 720 | 0.7% |
| 7 | 670 | 0.7% |
| 8 | 718 | 0.7% |
| 9 | 698 | 0.7% |
| Value | Count | Frequency (%) |
| 4923 | 1 | |
| 4917 | 1 | |
| 4863 | 1 | |
| 4845 | 1 | |
| 4844 | 1 | |
| 4826 | 1 | |
| 4817 | 1 | |
| 4803 | 1 | |
| 4797 | 1 | |
| 4794 | 1 |
friendships_initiated
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 1519 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 107.4887268 |
| Minimum | 0 |
|---|---|
| Maximum | 4144 |
| Zeros | 2988 |
| Zeros (%) | 3.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 772.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 17 |
| median | 46 |
| Q3 | 117 |
| 95-th percentile | 418 |
| Maximum | 4144 |
| Range | 4144 |
| Interquartile range (IQR) | 100 |
Descriptive statistics
| Standard deviation | 188.8667665 |
|---|---|
| Coefficient of variation (CV) | 1.757084414 |
| Kurtosis | 42.52974172 |
| Mean | 107.4887268 |
| Median Absolute Deviation (MAD) | 36 |
| Skewness | 5.151078708 |
| Sum | 10621821 |
| Variance | 35670.65547 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2988 | 3.0% |
| 1 | 2209 | 2.2% |
| 2 | 1546 | 1.6% |
| 3 | 1354 | 1.4% |
| 4 | 1348 | 1.4% |
| 6 | 1325 | 1.3% |
| 5 | 1325 | 1.3% |
| 11 | 1317 | 1.3% |
| 8 | 1312 | 1.3% |
| 13 | 1276 | 1.3% |
| Other values (1509) | 82818 |
| Value | Count | Frequency (%) |
| 0 | 2988 | |
| 1 | 2209 | |
| 2 | 1546 | |
| 3 | 1354 | |
| 4 | 1348 | |
| 5 | 1325 | |
| 6 | 1325 | |
| 7 | 1234 | |
| 8 | 1312 | |
| 9 | 1243 |
| Value | Count | Frequency (%) |
| 4144 | 1 | |
| 3654 | 1 | |
| 3594 | 1 | |
| 3538 | 1 | |
| 3415 | 1 | |
| 3238 | 1 | |
| 3233 | 1 | |
| 3086 | 1 | |
| 3078 | 1 | |
| 3024 | 1 |
| Distinct | 2921 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 156.1243498 |
| Minimum | 0 |
|---|---|
| Maximum | 25111 |
| Zeros | 22277 |
| Zeros (%) | 22.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 772.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 11 |
| Q3 | 81 |
| 95-th percentile | 726.15 |
| Maximum | 25111 |
| Range | 25111 |
| Interquartile range (IQR) | 80 |
Descriptive statistics
| Standard deviation | 572.5748897 |
|---|---|
| Coefficient of variation (CV) | 3.667428497 |
| Kurtosis | 200.3880019 |
| Mean | 156.1243498 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 11.02377153 |
| Sum | 15427896 |
| Variance | 327842.0043 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 22277 | |
| 1 | 6916 | 7.0% |
| 2 | 4428 | 4.5% |
| 3 | 3235 | 3.3% |
| 4 | 2503 | 2.5% |
| 5 | 2025 | 2.0% |
| 6 | 1804 | 1.8% |
| 7 | 1615 | 1.6% |
| 8 | 1430 | 1.4% |
| 9 | 1379 | 1.4% |
| Other values (2911) | 51206 |
| Value | Count | Frequency (%) |
| 0 | 22277 | |
| 1 | 6916 | 7.0% |
| 2 | 4428 | 4.5% |
| 3 | 3235 | 3.3% |
| 4 | 2503 | 2.5% |
| 5 | 2025 | 2.0% |
| 6 | 1804 | 1.8% |
| 7 | 1615 | 1.6% |
| 8 | 1430 | 1.4% |
| 9 | 1379 | 1.4% |
| Value | Count | Frequency (%) |
| 25111 | 1 | |
| 21652 | 1 | |
| 16732 | 1 | |
| 16583 | 1 | |
| 14799 | 1 | |
| 14355 | 1 | |
| 14050 | 1 | |
| 14039 | 1 | |
| 13692 | 1 | |
| 13622 | 1 |
likes_received
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 2675 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 142.6769414 |
| Minimum | 0 |
|---|---|
| Maximum | 261197 |
| Zeros | 24392 |
| Zeros (%) | 24.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 772.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 8 |
| Q3 | 59 |
| 95-th percentile | 561 |
| Maximum | 261197 |
| Range | 261197 |
| Interquartile range (IQR) | 58 |
Descriptive statistics
| Standard deviation | 1389.045639 |
|---|---|
| Coefficient of variation (CV) | 9.735600052 |
| Kurtosis | 17361.07811 |
| Mean | 142.6769414 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 112.0109727 |
| Sum | 14099050 |
| Variance | 1929447.786 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 24392 | |
| 1 | 7291 | 7.4% |
| 2 | 4537 | 4.6% |
| 3 | 3342 | 3.4% |
| 4 | 2663 | 2.7% |
| 5 | 2367 | 2.4% |
| 6 | 1868 | 1.9% |
| 7 | 1678 | 1.7% |
| 8 | 1535 | 1.6% |
| 9 | 1349 | 1.4% |
| Other values (2665) | 47796 |
| Value | Count | Frequency (%) |
| 0 | 24392 | |
| 1 | 7291 | 7.4% |
| 2 | 4537 | 4.6% |
| 3 | 3342 | 3.4% |
| 4 | 2663 | 2.7% |
| 5 | 2367 | 2.4% |
| 6 | 1868 | 1.9% |
| 7 | 1678 | 1.7% |
| 8 | 1535 | 1.6% |
| 9 | 1349 | 1.4% |
| Value | Count | Frequency (%) |
| 261197 | 1 | |
| 178166 | 1 | |
| 152014 | 1 | |
| 106025 | 1 | |
| 82623 | 1 | |
| 53534 | 1 | |
| 52964 | 1 | |
| 45633 | 1 | |
| 42449 | 1 | |
| 39536 | 1 |
mobile_likes
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 2394 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 106.1564391 |
| Minimum | 0 |
|---|---|
| Maximum | 25111 |
| Zeros | 34994 |
| Zeros (%) | 35.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 772.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 4 |
| Q3 | 46 |
| 95-th percentile | 482 |
| Maximum | 25111 |
| Range | 25111 |
| Interquartile range (IQR) | 46 |
Descriptive statistics
| Standard deviation | 445.511712 |
|---|---|
| Coefficient of variation (CV) | 4.19674695 |
| Kurtosis | 360.808137 |
| Mean | 106.1564391 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 14.16034567 |
| Sum | 10490167 |
| Variance | 198480.6856 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 34994 | |
| 1 | 6287 | 6.4% |
| 2 | 3930 | 4.0% |
| 3 | 2910 | 2.9% |
| 4 | 2262 | 2.3% |
| 5 | 1790 | 1.8% |
| 6 | 1597 | 1.6% |
| 7 | 1395 | 1.4% |
| 8 | 1210 | 1.2% |
| 9 | 1148 | 1.2% |
| Other values (2384) | 41295 |
| Value | Count | Frequency (%) |
| 0 | 34994 | |
| 1 | 6287 | 6.4% |
| 2 | 3930 | 4.0% |
| 3 | 2910 | 2.9% |
| 4 | 2262 | 2.3% |
| 5 | 1790 | 1.8% |
| 6 | 1597 | 1.6% |
| 7 | 1395 | 1.4% |
| 8 | 1210 | 1.2% |
| 9 | 1148 | 1.2% |
| Value | Count | Frequency (%) |
| 25111 | 1 | |
| 21652 | 1 | |
| 16732 | 1 | |
| 14039 | 1 | |
| 13529 | 1 | |
| 12934 | 1 | |
| 12639 | 1 | |
| 12104 | 1 | |
| 12083 | 1 | |
| 11959 | 1 |
mobile_likes_received
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 2002 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 84.12564513 |
| Minimum | 0 |
|---|---|
| Maximum | 138561 |
| Zeros | 29956 |
| Zeros (%) | 30.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 772.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 4 |
| Q3 | 33 |
| 95-th percentile | 317 |
| Maximum | 138561 |
| Range | 138561 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 840.577049 |
|---|---|
| Coefficient of variation (CV) | 9.991923958 |
| Kurtosis | 15500.8792 |
| Mean | 84.12564513 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 107.4678309 |
| Sum | 8313128 |
| Variance | 706569.7754 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 29956 | |
| 1 | 8227 | 8.3% |
| 2 | 4942 | 5.0% |
| 3 | 3598 | 3.6% |
| 4 | 2936 | 3.0% |
| 5 | 2382 | 2.4% |
| 6 | 2017 | 2.0% |
| 7 | 1744 | 1.8% |
| 8 | 1520 | 1.5% |
| 9 | 1433 | 1.5% |
| Other values (1992) | 40063 |
| Value | Count | Frequency (%) |
| 0 | 29956 | |
| 1 | 8227 | 8.3% |
| 2 | 4942 | 5.0% |
| 3 | 3598 | 3.6% |
| 4 | 2936 | 3.0% |
| 5 | 2382 | 2.4% |
| 6 | 2017 | 2.0% |
| 7 | 1744 | 1.8% |
| 8 | 1520 | 1.5% |
| 9 | 1433 | 1.5% |
| Value | Count | Frequency (%) |
| 138561 | 1 | |
| 131244 | 1 | |
| 89911 | 1 | |
| 73333 | 1 | |
| 43410 | 1 | |
| 30754 | 1 | |
| 30387 | 1 | |
| 27353 | 1 | |
| 20770 | 1 | |
| 18925 | 1 |
| Distinct | 1724 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.9679107 |
| Minimum | 0 |
|---|---|
| Maximum | 14865 |
| Zeros | 60927 |
| Zeros (%) | 61.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 772.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 7 |
| 95-th percentile | 208 |
| Maximum | 14865 |
| Range | 14865 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 285.7627019 |
|---|---|
| Coefficient of variation (CV) | 5.718924362 |
| Kurtosis | 448.7068585 |
| Mean | 49.9679107 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.90545084 |
| Sum | 4937729 |
| Variance | 81660.32178 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 60927 | |
| 1 | 4678 | 4.7% |
| 2 | 2750 | 2.8% |
| 3 | 1945 | 2.0% |
| 4 | 1415 | 1.4% |
| 5 | 1201 | 1.2% |
| 6 | 1075 | 1.1% |
| 7 | 895 | 0.9% |
| 8 | 790 | 0.8% |
| 9 | 755 | 0.8% |
| Other values (1714) | 22387 | 22.7% |
| Value | Count | Frequency (%) |
| 0 | 60927 | |
| 1 | 4678 | 4.7% |
| 2 | 2750 | 2.8% |
| 3 | 1945 | 2.0% |
| 4 | 1415 | 1.4% |
| 5 | 1201 | 1.2% |
| 6 | 1075 | 1.1% |
| 7 | 895 | 0.9% |
| 8 | 790 | 0.8% |
| 9 | 755 | 0.8% |
| Value | Count | Frequency (%) |
| 14865 | 1 | |
| 12903 | 1 | |
| 11077 | 1 | |
| 10763 | 1 | |
| 10627 | 1 | |
| 10539 | 1 | |
| 10255 | 1 | |
| 10232 | 1 | |
| 9902 | 1 | |
| 9431 | 1 |
www_likes_received
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 1634 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 58.55129632 |
| Minimum | 0 |
|---|---|
| Maximum | 129953 |
| Zeros | 36817 |
| Zeros (%) | 37.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 772.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 20 |
| 95-th percentile | 227 |
| Maximum | 129953 |
| Range | 129953 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 601.9046288 |
|---|---|
| Coefficient of variation (CV) | 10.27995393 |
| Kurtosis | 23779.52219 |
| Mean | 58.55129632 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 126.1856832 |
| Sum | 5785922 |
| Variance | 362289.1822 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 36817 | |
| 1 | 8497 | 8.6% |
| 2 | 5096 | 5.2% |
| 3 | 3582 | 3.6% |
| 4 | 2823 | 2.9% |
| 5 | 2313 | 2.3% |
| 6 | 1916 | 1.9% |
| 7 | 1596 | 1.6% |
| 8 | 1442 | 1.5% |
| 9 | 1369 | 1.4% |
| Other values (1624) | 33367 |
| Value | Count | Frequency (%) |
| 0 | 36817 | |
| 1 | 8497 | 8.6% |
| 2 | 5096 | 5.2% |
| 3 | 3582 | 3.6% |
| 4 | 2823 | 2.9% |
| 5 | 2313 | 2.3% |
| 6 | 1916 | 1.9% |
| 7 | 1596 | 1.6% |
| 8 | 1442 | 1.5% |
| 9 | 1369 | 1.4% |
| Value | Count | Frequency (%) |
| 129953 | 1 | |
| 62103 | 1 | |
| 39605 | 1 | |
| 39213 | 1 | |
| 34039 | 1 | |
| 32692 | 1 | |
| 29337 | 1 | |
| 23147 | 1 | |
| 22644 | 1 | |
| 15096 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | age | gender | tenure | friend_count | friendships_initiated | likes | likes_received | mobile_likes | mobile_likes_received | www_likes | www_likes_received | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 14 | male | 266.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1 | 1 | 14 | female | 6.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 2 | 2 | 14 | male | 13.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3 | 3 | 14 | female | 93.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 4 | 4 | 14 | male | 82.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 5 | 5 | 14 | male | 15.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 6 | 6 | 13 | male | 12.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 7 | 7 | 13 | female | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8 | 8 | 13 | male | 81.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 9 | 9 | 13 | male | 171.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Last rows
| df_index | age | gender | tenure | friend_count | friendships_initiated | likes | likes_received | mobile_likes | mobile_likes_received | www_likes | www_likes_received | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 98808 | 98993 | 19 | male | 394.0 | 4538 | 4144 | 4501 | 15088 | 4435 | 5961 | 66 | 9127 |
| 98809 | 98994 | 20 | female | 402.0 | 1988 | 332 | 7351 | 106025 | 7248 | 73333 | 103 | 32692 |
| 98810 | 98995 | 20 | female | 699.0 | 3611 | 973 | 4507 | 7768 | 4414 | 6909 | 93 | 859 |
| 98811 | 98996 | 24 | female | 182.0 | 2938 | 1272 | 6018 | 17765 | 5843 | 11708 | 175 | 6057 |
| 98812 | 98997 | 28 | female | 290.0 | 2218 | 1618 | 4626 | 10268 | 4290 | 4250 | 336 | 6018 |
| 98813 | 98998 | 68 | female | 541.0 | 2118 | 341 | 3996 | 18089 | 3505 | 11887 | 491 | 6202 |
| 98814 | 98999 | 18 | female | 21.0 | 1968 | 1720 | 4401 | 13412 | 4399 | 10592 | 2 | 2820 |
| 98815 | 99000 | 15 | female | 111.0 | 2002 | 1524 | 11959 | 12554 | 11959 | 11462 | 0 | 1092 |
| 98816 | 99001 | 23 | female | 416.0 | 2560 | 185 | 4506 | 6516 | 4506 | 5760 | 0 | 756 |
| 98817 | 99002 | 39 | female | 397.0 | 2049 | 768 | 9410 | 12443 | 9410 | 9530 | 0 | 2913 |